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Abstract 

o 

P\) ■ The independence number of a sparse random graph G(n, m) of average degree d ~ 2m/n is 

well-known to be a{G{'n, m)) ^ 2n\n{d)/d with high probability. Moreover, a trivial greedy algo- 
Q ■ rithm w.h. p. finds anindependentset of size (l+o(l))-nln((i)/(i, i.e., half the maximumsize. Yetin 

^^ I spite of 30 years of extensive research no efficient algorithm has emerged to produce an independent 

set with (1 +e)n \n{d)/d, for any fixed e > 0. In this paper we prove that the combinatorial structure 

of the independent set problem in random graphs undergoes a phase transition as the size k of the 

independent sets passes the point k ^ n \n{d)/d. Roughly speaking, we prove that independent sets 

of size fc > (1 + e)n \Tv{d)/d form an intricately ragged landscape, in which local search algorithms 

are bound to get stuck. We illustrate this phenomenon by providing an exponential lower bound for 

the Metropolis process, a Markov chain for sampling independents sets. 

C/3 '. 
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1 Introduction and Results 

1.1 Probabilistic analysis and the independent set problem 

In the early papers on the subject, the motivation for the probabilistic analysis of algorithms was to 
alleviate the glum of worst-case analyses by a brighter 'average -case' scenario. This optimism was 
stirred by early analyses of simple, greedy-type algorithms, showing that these perform rather well on 
randomly generated input instances, at least for certain ranges of the parameters. Examples of such 
analyses include Grimmett and McDiarmid lfT3l (independent set problem), Achlioptas and Molloy Q 
(graph coloring), and Frieze and Suen ifTOl (/c-SAT). Yet, remarkably, in spite of 30 years of research, for 
many problems no efficient algorithms, howsoever sophisticated, have been found to outperform those 
simple greedy algorithms. 

The independent set problem in random graphs G{n, m) is a case in point. Recall that G{n, m) is 
a graph on n vertices obtained by choosing m edges uniformly at random (without replacement). We 
say that G{n, m) has a property with high probability if the probability that the property holds tends to 
1 as n — )■ oo. One of the eaiiiest results in the theory of random graphs is a non-constructive argument 
showing that for m = ^ (2) the independence number of G{n, m) is a{G{n, m)) ^ 2 log2(n) w.h.p. ||7] 
muni. Grimmett and McDiarmid |[T3l analyzed a simple algorithm that just constructs an inclusion- 
maximal independent set greedily on G(n, m): it yields an independent set of size (1 + o(l)) log2 n 
w.h.p., about half the maximum size. But no algorithm is known to produce an independent set of size 
(1 + e) log2 n for any fixed e > in polynomial time with a non-vanishing probability, neither on 
the basis of a rigorous analysis, nor on the basis of experiments or other evidence. In fact, devising 
such an algorithm is probably the most prominent open problem in the algorithmic theory of random 
graphs |l9l[T6l. (However, note that one can find a maximum independent set w.h.p. by trying all n*^(^°") 
possible sets of size 2 log2 n.) 

Matters are no better for sparse random graphs. If we let d = 2m/n denote the average degree, 
then non-constructive arguments yield a{G{n,m)) ~ — ^--^ • n for the regime 1 <^ d = o{n). In the 
case d S> y/n, the proof of this fact is via a simple second moment argument 15] HQ. By contrast, 
for 1 ^ d ^ y/n, the second moment argument breaks down and additional methods such as large 
deviations inequalities are needed [Si. Yet in either case, no algorithm is known to find an independent 
set of size (1 + e)^ • n in polynomial time with a non-vanishing probability, while 'greedy' yields an 
independent set of size (1 + o(l))^ • n w.h.p. In the sparse case, the time needed for exhaustive search 
scales as exp(^ ln(d) ln(n/d)), i.e., the complexity grows as d decreases. 

In the present paper we explore the reason for the apparent 'hardness' of finding large independent 
sets in random graphs. The focus is on sparse random graphs, both conceptually and computationally the 
most difficult case. We exhibit a phase transition in the structure of the problem that occurs as the size 
of the independent sets passes the point ^^ • n up to which efficient algorithms are known to succeed. 
Roughly speaking, we show that independent sets of of sizes bigger than (1+e)^ -n form an intricately 
ragged landscape, in which local-search algorithms are likely to get stuck. Thus, ironically, instead of 
showing that the 'average case' scenario is brighter, we end up suggesting that random graphs provide a 
rich source of worst-case examples. 

Taking into account the (substantially) different nature of the independent set problem, our work 
complements the results obtained in UJ for random constraint satisfaction problem such as fc-SAT or 
graph coloring and in [21] for and a wide class of symmetric CSPs. Furthermore, very recently B. 
Rossman |[23l obtained a monotone circuit lower bound for the clique problem on random graphs that is 
exponential in the size of the clique. The setup of |[23]| is somewhat orthogonal to our contribution, as 
we are concerned with the case that the size of the desired object (i.e., the independent set) is linear in 
the number of vertices, while Il23l deals with the case that the size of the chque is 0(1) in terms of the 
order of the graph. Nevertheless, the punchline of viewing random graphs as a source of computational 



hardness is similar. 



1.2 Results 



Throughout the paper we will be dealing with sparse random graphs where the average degree d = 2m /n 
is 'large' remains bounded as n — )• oo. To express this, we will refer to functions e^ that tends to zero as 
d gets largely In this case a{G{n^ m)) = (2 — e^)^ • n and the greedy algorithm finds independent sets 
of size (1 + ^'d)^^ ■ ™ w.h.p., where e^j ^'d ~^ ^^^ large d. However, no efficient algorithm is known to 
find independent sets of size (1 + e')^ " ™ for any^zxeJ e' > 0. 

For a graph G and an integer k we let Sk{G) denote the set of all independent sets of size exactly k. 



n. 



What we will show is that in G{n, m) the set Sk{G{n, m)) undergoes a phase transition as /c ~ -^ 
For two sets 5, T C V^ we let S'AT denote the symmetric difference of S, T. Moreover, dist(S', T) = 
|S'AT| is the Hamming distance of 5, T viewed as vectors in {0, 1} . 

To state the result for k smaller than ^n, we need the following concept. Let 5 be a set of subsets 
of V, and let 7 > be an integer. We say that S is ^-connected if for any two sets o", r G 5 there exist 
(Ti, . . . , (Tat G S such that cji = a, aj\]- = r, and dist{at, crt+i) < 7 for all 1 < t < N. If Sk{G{n, m)) 
is 7-connected for some 7 = 0(1), one can easily define various simple Markov chains on Sk{G) that 
are ergodic. 

Theorem 1 There is e,^ — t- such that Sk{G{n, m)) is 0(1) -connected w.h.p. for any 

, , . In d 

k < (1 - ^d)—f- -n. 

The proof actually yields an efficient algorithm for finding a path of length 0(n) between two given 
independent sets of size k w.h.p. 

By contrast, our next result shows that for k > ^ • n the set Sk{G{n, m)) is not just disconnected 
w.h.p., but that it shatters into exponentially many, exponentially tiny pieces. 

Definition 1 We say that Sk{G{n,m)) shatters if there exist constants 7, ^ > such that w.h.p. 
Sk{G{n, m)) admits a partition into classes so that: 

1. Each class contains at most exp {—"fn) \Sk{G{n, m))\ independent sets. 

2. If two independent sets belong to different classes, their distance is at least (n. 

Theorem 2 There is e^ — )• 50 that Sk{G{n, m)) shatters for all k with 

And , , And 

{l + ed)—-n<k<{2-£d)—-n. (1) 

Theorems [T] and [2] deal with the geometry of a single 'layer' Sk{G{n,m)) of independents of a 
specific size. The following two results explore if/how a 'typical' independent set in Sk{G{n,m)) can 
be extended to a lai^ger one. To formalize the notion of 'typical', we let Afc(n, m) signify the set of all 
pairs (G, a), where G is a graph on n vertices F = {1, . . . , n} and a of G is an independent set of size 
k. Let Uk{n, m) be the probability distribution induced on Afe(n, m) by the following experiment. 

Choose a graph G = G{n, m). 

If a{G) > k, choose an independent set a G Sk{G) uniformly at random. 



'The reason why we need to speak about d 'large' is that the sparse random graph G(n, m) is not connected. This implies, 
for instance, that algorithms can find independent sets of size (1 + ed)n\n{d)/d for some e^ — J- by optimizing carefully 
over the small tree components of G{n, m). Our results/proofs actually carry over to the case that d — d{n) tends to infinity 
as n grows, but to keep matters as simple as possible, we will focus on fixed d in this extended abstract. 



We say a pair {G,a) chosen from the distribution Uk (n, m) has a property V with high probability if the 
probabiUty of (G, a) ^ V tends to one as n — ;• oo. 

Definition 2 Let 7, 5 > 0, let G be a graph, and let a be an independent set ofG. We say that (G, a) is 
(7, (5)-expandable ifG has an independent set r such that \t\ > (1 + 7)|<7| and |t n (t| > (1 — 5)\a\. 



Theorem 3 There is e^ — )• such that for any e^ < e < 1 — £d the following is true. For k 

Inj 
d 



(1 — e)^ • n a pair {G, a) chosen from the distribution h(k{n, m) is (2e/(l — e),0)-expandable w.h.p. 



Theorem |3] shows that w.h.p. in a random graph G{n,m) almost all independent sets of size k 

mde( 
size ^ • n, i.e., as e — >■ 0, the typical potential for expansion shrinks. 



(1 — e)^ • n are contained in some bigger independent set of size (1 + e)^ • n. That is, they can be 
expanded beyond the critical size ^ • n where shattering occurs. However, as k approaches the critical 



Theorem 4 There is e^ — )• such that for e^ < e < 1 — e^ and /c = (1 + e) ^ • n w.h.p. a pair {G, a) 
chosen from the distribution Uk{n, m) is not (7, 6) -expandable for any j > Ed and 

In other words. Theorem |4] shows that for k = {1 + s)^f^ ■ n, a typical a G Sk{G{n, m)) cannot be 
expanded to an independent set of size (1 + 7)A;, 7 > e^^ without first reducing its size below 

(l-S)k = {l-e- 7(1 + e) + ^£d)—r- ■ n < — ■ n. 

(However, a random independent set of size fc < (2 — Sd) ln{d)n/d is typically not inclusion-maximal 
because, for instance, it is unlikely to contain all isolated vertices of the random graph G{n, m).) 

Metaphorically, the above results show that finding a maximum independent set in G{n, m) is like 
climbing a ragged mountain range in dense fog, in quest of the highest summit. Beyond the 'plateau 
level' k ~ ^ • n there is an abundance of smaller 'peaks', i.e., independent sets of sizes {1 + e)k, 
almost all of which are not expandable. Thus, ascending blindly, we are bound to get stuck way below 
the overall summit of height 2^ • n. Furthermore, once we find ourselves on one of the lesser peaks of 
height (1 + e)k, we will have to return all the way to plateau level to try anew. 

The algorithmic equivalent of such a mountaineer is a Markov chain called the metropolis process. 
For a given graph G its state space is the set of all independent sets of G. Let It be the state at time t. 
In step t + 1, the process chooses a vertex u of G uniformly at random. \fv € It, then with probability 
1/A the next state is It+i = It\ {v}, and with probability 1 — 1/A we let It-^i = It, where A > 1 is a 
'temperature' parameter. Ifv^ItU N{It), then I^+i = ItU {v}. Finally, if t; G N{It), then It^i = It. 
The probability of an independent set S* of G in the stationary distribution equals X'^'/Z{G, A), where 

n 

Z(G,A) = J]A^.|5fc(G)| 

A;=0 

is the partition function. Hence, the larger A, the higher the mass of large independent sets. Let 



/i(G,A) = ^A:A^--|5fc(G)|/Z(G,A) 



k=0 



denote the average size of an independent set of G under the stationary distribution. 

Our above results on the structure of the sets Sk{G{n, m)) imply that w.h.p. the mixing time of the 
metropolis process is exponential if the parameter A is tuned so that it tries to ascend to independent sets 
bigger than ^ • n. 



Corollary 1 There is e^ — )• such that for A > 1 with 

(1 + erf)^ • n < E [fi{G{n, m), A)] < (2 - Sd)^ ■ n. (2) 

the mixing time of the metropolis process on G{n, m) is exp(r2(n)) w.h.p. 

1.3 Related work 

To our knowledge, the connection between transitions in the geometry of the 'solution space' (in our 
case, the set of all independent sets of a given size) and the apparent failure of local algorithms in finding 
a solution has been pointed first out in the statistical mechanics literature ifTSlfTSlfTTI . In that work, 
which mostly deals with CSPs such as fc-SAT, the shattering phenomenon goes by the name of 'dynamic 
replica symmetry breaking.' Our present work is clearly inspired by the statistical mechanics ideas, 
although we are unaware of explicit contributions from that line of work addressing the independent set 
problem in the case of random graphs with average degree d » 1. Generally, the statistical mechanics 
work is based on deep, insightful, but, alas, mathematically non-rigorous techniques. 

The recent work in Q, on recontruction thresholds on weighted independent sets on trees, predicted 
some of the results of this work on the basis of the heuristic connection between reconstruction on 
trees and clustering of solution space of r-CSPs. Our result here come as a veryfication of all of these 
predictions. 

In the case that the averge degree d satisfies d » y/n, the independent set problem in random graphs 
is conceptually somewhat simpler than in the case of d = o{y/n). The reason for this is that for d ^> y/n 
the second moment method can be used to show that the number of independent sets is concentrated 
about its mean. As we will see in Corollary |2]below, this is actually untrue for sparse random graphs. 

The work that is most closely related to ours is a remarkable paper of Jerrum |[T4l . who studied the 
metropolis process on random graphs G{n, m) with average degree d = 2m /n > r?!"^ . The main result 
is that w.h.p. there exists an initial state from which the expected time for the metropolis process to find 
an independent set of size (1 + e) ^ • n is superpolynomial. This is quite a non-trivial achievement, as it 
is a result about the inital steps of the process where the states might potentially follow a very different 
distribution than the stationary distribution. The proof of this fact is via a concept called 'gateways', 
which is somewhat reminiscent of the expandability property in the present work. However, Jerrum's 
proof hinges upon the fact that the number of independent sets of size /c ~ (1 + e) ^ • n is concentrated 
about its mean. The arguments from the present work can be used to extend Jerrum's result to the sparse 
case, where the expected time until a large independent set is found is fully exponential in n w.h.p., via 
Theorem [5] below. (Instead of showing this, we chose to present Corollary [TJ which just yields a bound 
on the mixing time of the Metropolis process.) 

For extremely sparse random graphs, namely d < e ?» 2.718, finding a maximum independent set in 
G{n, m) is easy. More specifically, the greedy matching algorithm of Karp and Sipser ITTl can easily 
be adapted so that it yields a maximum independent set w.h.p. But this approach does not generalize to 
average degrees d > e (see, however, |[T2l for a particular type of weighted independent sets). 

The results of the present paper extend the main results from Achlioptas and Coja-Oghlan [H, which 
dealt with constraint satisfaction problems such as A;-SAT or graph coloring, to the independent set 
problem. This requires new ideas, because the natural questions are somewhat different (for instance, 
the concept of 'expandability' has no counterpiece in CSPs). Furthermore, in OJ we conjectured but did 
not manage to prove the counterpiece of Theorem [T] on the connectivity of 5fc(G(n, m,)). 

On a technical level, we owe to HI the idea of analyzing the distribution Uk{n, m) via a different 
distribution Vk{n, m), the so-called 'planted model' (see Section|2]for details). However, the proof that 
this approximation is indeed valid (Theorem |5] below) requires a rather different approach. In HI we 
derived the corresponding result from the second moment method in combination with sharp threshold 



results. By contrast, here we use an indirect approach that reduces the problem of estimating the number 
\Sk{G{n,m))\ of independent sets of a given size to the problem of (very accurately) estimating the 
independence number a{G{n,m)). Indeed, the argument used here carries over to other problems, 
particularly random fc-SAT, for which it yields a conceptually simpler proof than given in UJ- 

2 Approaching the distribution Uk {n, m) 

2.1 The planted model 

The main results of this paper deal with properties of 'typical' independents sets of a given size in a ran- 
dom graph, i.e., the probability distribution Uk{n, m). In the theory of random discrete structures often 
the conceptual difficulty of analyzing a probability distribution is closely linked to the computational 
difficulty of sampling from that distribution (e.g., Il22l Chapter 9]). This could suggest that analyz- 
ing Uk{n,m) is a formidable task, because for A; > (1 + e)n\Yi{d)/d there is no efficient procedure 
known for finding an independent set of size /c in a random graph G{n,m), let alone for sampling one 
at random. In effect, we do not know of an efficient method for sampling from Uk{n, m). 

To get ai^ound this problem, we are going to 'approximate' the distribution Uk{n,m) by another 
distribution Vkin^m) on Kk{n,m), the so-called planted model, that is easy to sample from. This 
distribution is induced by the following experiment: 

Choose a subset a C [n] of size k uniformly at random. 

Choose a graph G with m edges in which a is an independent set uniformly at random. 

Output the pair (G, a). 

In other words, the probability assigned to a given pair (Go, ctq) G Afc(n, m) is 

-1 



Ppfc(n,m) [(GcCTo)] 



n^ t,^)-{^] 



k I \ m 



(3) 



i.e., Vk{n, m) is nothing but the uniform distribution on Ak{n, m). The key result that allows us to study 
the distribution Uk{n, m) is the following 

Theorem 5 There is e^ — )■ such that for k < (2 — ed)n ln{d)/d we have 

PWfe(n,m) [£] < Ppfc(n,m) [<?] " exp f Un^ln^ d/d^ ] for any event £ C Afc(n, m). 

To establish Theorem|5l we need to find a way to compai^e Vk{n, m) and Uk{n, m). Suppose that 
A; < (2 — ed)n In((i)/(i is such that a{G{n, m)) > k w.h.p. Then the probability of a pair (Go, ao) G 
A/;(n, m) under the distribution Uk{n, m) is 



PWfe(n,m) [(CoifTo)] 



^^^l • |5fc(Go ' 
m I 



(4) 



(because we first choose a graph uniformly, and then an independent set of that graph). Hence, the 
probabilities assigned to (Go, oq) under ^ and ([3]l coincide (asymptotically) iff 



A moment's reflection shows that the expression on the r.h.s. of ^ is precisely the expected number 
E|5fc(G(ri, 7n))| of independent sets of size k. Thus, Vk{n,m) and Uk{n,m) coincide asymptotically 
iff the number \Sk{G{n, m))| of independents sets of size k is concentrated about its expectation. 



This is indeed the case in 'dense' random graphs with m » n^^"^. For this regime one can perform a 
'second moment' computation to show that \Sk{G{n, m))\ ~ E\Sk{G{n, m))\ w.h.p., whence the mea- 
sures Vk{n^ m) and h(k{n, m) are interchangeable. This fact forms (somewhat implicitly) the foundation 
of the proofs in |[T4l . 

By contrast, in the spai^se case m <^ n^/"^ a straight second moment argument fails utterly. As 
it turns out, this is because the number \Sk{G{n,m))\ simply does not concentrate anymore. In fact, 
maybe somewhat surprisingly Theorem |5] can be used to infer the following corollary, which shows 
that in sparse random graphs the expectation 'E\Sk{G{n,m))\ 'overestimates' the typical number of 
independent sets by an exponential factor w.h.p. 

Corollary 2 There exist functions e^ — )• and g{d) > such that for (1 + ed)nln{d)/dn < k < 
(2 — efi)nln{d)/d we have 

\Sk{G{n,m))\ < E\Sk{G{n,in))\ ■ exp{—g{d)n) w.h.p. 

Conversely, in order to prove Theorem |5] we need to bound the 'gap' between the typical value of 
|5fc(G(n, m))\ and its expectation from above. This estimate can be summarized as follows. 

Proposition 1 There is e^ — )• such that for k < (2 — ed)n ln{d)/d we have 



|5fc(G(n,m))| > E\Sk{G{n,m))\ ■ exp UnJln^d/d^ w.h.p. 



Using Proposition [H equations ^ and (01), and a double counting, one completes the proof of Theorem|5] 
fairly easily. 

3 Preliminaries 

In this section we present some preliminary, technical, results which aie going to be very useful in rest 
of this work. The first theorem we present is known as the Chemoff bounds. 

Theorem 6 Let Ii,l2 . ■ ■ ,In be independent Bernoulli trials such that Pr [Ij = 1] = pi, for 1 < i < n 

and < Pi < 1. Then, for X = X]"^^ ^i' ^ ~ ^i-^] ^^^ < 5 < 1 

Pr[X < (1 - 5)/i] < exp {-p5^/2) (6) 

and for < 5 < 2e — 1 

Pr[X > (1 + 6)^l] < exp (-/i^V^) ■ (V) 

Theorem 7 Let Ii,l2 ■ ■ ■ ,In be Bernoulli trials such that for 1 < i < n Pr [li = 1] = pi, where 
< Pi < 1. Furthermore, assume that the variables are negatively dependent, i.e. 

Pr[h = l\Ij = 1] < Pr[Ii = 1] for any i,j. 

For the variable Y = Y17=i ^i' ^^^ concentration bounds of Theorem^still apply. 

In many cases we can avoid a large deal of complex derivations if instead of G{n, m) we consider 
the following, closely related model of random graphs. Let G* (n, m) be random graph model defined by 
inserting into a set of n vertices m edges as follows: Each of the m edges chooses its incident vertices 
u.a.r. and with replacement. After inserting all the m edges, delete all the self loops and replace the 
multi-edges with a single edge. 

The model G*{n, m) is related to the model G(?i, m) the following lemmas describe. 



Lemma 1 Let Abe a decreasing graph property q For m = en, where c is a positive fixed constant, 
and for rn! = m{l — 5) it holds that 

Pr[G{n,m') e A]> Pr[G*{n,m) £ A] - exp {-m5'^ /17) 

for any < 5 < 1. 

Proof: Let X be the number of edges in G* (n, in) after removing self loops and multy-edges. For any 
descreasing graph property A, it holds that Pr[G*{n, m) G A\X > q] < Pr[G{n, q) G A]. Applying 
the law of total probability and the previous inequality we get that for any integer m' > it holds 

Pr[G*{n,m) e A] < Pr[G{n,m') G A] + Pr[X < m']. 

The lemma follows by deriving appropriate bounds for Pr[X < m'] when ni' = {I — 6)m. 

Consider the process that creates G* (n, m) and let Y be the number of edges that are not a self-loop 
or form multi-edges with some others (before the deletions). It always holds, Y < X, i.e. y is a lower 
bound on the number of edges in G* ^ after the deletion of multiple edges and the self loops. The 
lemma follows by deriving an appropriate concentration result for Y. 

Let Ij be the indicator random variable such that Ij = 1 if the edge j is placed such that it is not a 
self-loop or does not form multi-edge with some others, while Ij = otherwise. It is direct to get that 
E[Ij]>l-^. Also, it is direct that Y = Y,j Ij and S[y] > (1 - |^)m. 

Also, we should remark that the random variables Ij are negatively dependent with each other. Thus, 
by Theorem|7]we have that Pr\Y < (1 — 5)m] < exp (— m(5^/17). The lemma follows. 



Lemma 2 Let the variable Xk{G) denote the number of independent sets of the graph G. For nri 
<d{n), it holds that 

E[Xk{Gn,^)\ ~ E[Xk{Gn,m)]- 

Finally we present a lemma that it will be very usefull in the course of this paper. 

Lemma 3 (Expectation.) Let G* „ be of expected degree d = 2nn/n and for real e let 

2n 
k= — (lnd-lnln(i+l-ln2-e) 

IfXn^m{k) is the number of independent sets of size k in G* ^, then it holds that 

E[Xn,n.{k)] =expU(e- In (l - ^^^^h±:l+hl±l\ _ 1 (^/^) _ o(^/,,)2 ^^^^ 



Proof: Clearly it holds 



Let s = ^. We have that 



"■ \ 2\m 



E[Xn,Uk)] = (^]{l-{k/ny) 



In(^) = -n(s In s + (1 -s)ln(l - s)) + o(n) 
= ns{-lns + l- s/2-0{s'^)) + o{n) 
= A: (In d - In In d - In 2 + 1 - ln(l - q^) - kj2n) + 0{{k/nf)) 

where qd = ''"^""'inrf'"^"^^ - Using the fact that m = f re we get that 
ln(l-s2)"^ = -ns{ds/2 + ds^/4. + 0{ds^)) 

= -A: (Ind - Inlnd - ln2 + 1 - e + 0{\nd{k/nf)) . 
The lemma follows after straightforward calculations. 



^A graph property A is increasing iff given that A holds for a graph G(V, E), then A holds for any G(V, E'): E' I) E. 



4 Proof of Theorem IS 

Lemma 4 Suppose that there is a number q G [0, 1] such that P-Pf.in.m) [^] < 9- Then 

\{{G,a) G Afc(n,7n)nf}| < g • |Afc(n, m)|. 

Proof: It is direct that for every a C [n] of size k, the number of graphs that have independent set a 

is exactly i}-'^'^'^')- This implies that Vk{n,m) is the uniform distribution over all the members of 
Afc(n,?7i). The lemma follows. 

Proof of Theorem |5} Assume that a PK^(^n,m)[£] = C' where £ E Afc(n,m). Assume further that 
Pvk(n,m)[£] < g for some gG [0,1]- Proposition[T]implies that with high probability it holds |5fc(G„^m)| > 

1 ^^r^ I l4ni/ln^ d/d^ J E[Xn^rn{k)]. Thus, the assumption on Pij^(^n,m)[^] implies that 



2exp 



\{{G,a)eAkin,m)n£}\ > ^exp (-Un^Jln^ d/dA \Ak{n,m)\. 
Combining the above inequality with Lemma |4] we get that 

^exp(-Un^Jln^d/dA |Afc(n,m)| < \{{G,a) eAk{n,m)n£}\ <q\Ak{n,m)\. 



It is direct that it should hold C < 2g exp lin^/ln d/d^ . The theorem follows. 



4. 1 Proof of Proposition [T] 

Since the second moment method fails to yield a lower bound on the typical number \Sk{G{n, m))\ of 
independent sets, we need to invent a less direct approach to prove Proposition [T] Of course, the demise 
of the second moment argument also presented an obstacle to Frieze ISl in his proof that 

a{G{n,m))>{2-ed)nln{d)/d w.h.p. (8) 

However, unlike the number \Sk{G{n, m))\ of independent sets, the size a{G{n, m)) of the largest one 
actually is concentrated about its expectation. In fact, an arsenal of large deviations inequalities applies 
(e.g., Azuma's and Talagrand's inequality), and HI uses these to bridge the gap left by the second 
moment argument. Unfortunately, these large deviations inequalities draw a blank on \Sk{G{n,m))\. 
Therefore, we are going to derive the desired lower bound on \Sk{G{n, m))\ directly from ([S]). 

To simplify our derivations we consider the model of random graphs G* (n, m) and we show the 
following proposition. 

Proposition 2 There is e^ — ^ such that for k < {2 — ed)n ln{d)/d we have 



\Sk{G*{n,m))\ > E|5fc(G*(n,m))| • exp -Un\\ifd/d^ w.h.p. 



Then, Proposition [T] follows by Lemma[T] for (5 = n ^/^, and Lemma|2l 

Given some integer A; > and q G [0, 1], let Xn^m{k) = |cSfe(G* ^)| and let 

Mk,q = max{m G n\Pr[Xn,m{k) > 0] > 1 - g}. 



Lemma 5 Consider some integer k > 0, q £ [0, 1] and some m < Af^ g. It holds that 

Pr [Xn,m{k) <h-^- E[Xn,m{k)]] < 2q 

where h{n) > 2E[Xn,M,Jk)]. 

Proof: Consider the process where we have n vertices and at each step we add an edge by selecting 
u.a.r., with replacement, its endpoints. It is clear that after having inserted m edges and having deleted 
the multiple edges and selfloops, the resulting graph would be G* „. 

W.l.o.g. assume that m, k, q are such that m < M^ g. We look the above process at two specific 
moments, the first is exactly after having inserted the m-th edge and the second is exactly after having 
inserted the M^ ^-th edge. Let G* „ and G* j^ be the two graphs we see at these specific moments, 
after having deleted the multiple edges and selfloops, and let X„ m(A;) and Xn,A4 (k) be the number of 
independent sets of size k in each graph, correspondingly. It holds that 

E[Xn,M,Jk)\Xn,m{k)] = ^n,m(A:)(l " {kMY'"'"-"' . 

We let the event £i ="Xn,mik) < E[Xn^mik)]/h{n)" and assume that Pr[£i] = ( for some C G [0, 1] 
By the Markov inequality we get that 



Pr 



Xn,M,M<2 



E[XnAk)] 

h{n) 



(1 - {k/nYY'^-^^'^\8i 



The above inequality and the assumption that Pr[<Si] = C, imphes that 



Pr 



Xn,M,Jk)<2 



E[Xn,mik)] 

h{n) ' 



1 - {k/nf)^"-"-'^ 



> 1/2. 



>C/2. 



Since h{n) > 2E[Xn,Mk {k)], we get that Pr[Xn,Mk (k) < 1] > C/2- It is direct that it holds ( < 2q 



Thus, Pr[£i] < 2q. 



k.q ' 







Definition 3 For the integer k > let 

5k = 2{n/k) \n{n/k) + 2{n/k) - 0{\ii{n/k)^/{n/k)) 
Improving on the main Theorem in [8J we get the following theorem. 
Theorem 8 For the integer k and G*{n, m) of expected degree d < 5^ it holds that 

Pr[a{G*{n,m)) < k] < exp {-vhi^d/d^) . 



Also, for d = 5k it holds that E\Sk{Gn m)\ ^ ^xp I 14nwln^ d/d^ 



The proof of Theorem [8] appears in Section 1421 Proposition |2] follows by using Lemma |5] and taking 
Mk,q = -y^, the values of q and h{n) follow from Theorem[8] 

Recently, V. Dani and C. Moore O provided an improvement on the result of Theorem [S] In par- 
ticular, they showed that G* ^ of expected degree d < 2{n/k) \n{n/k) + 2{n/k) — 0{y^n/k) has an 
independent set of size k w.h.p. This new result can be used to improve slightly Theorem [S] 



4.2 Proof of Theorem |8] 

Definition 4 For G*{n, m) of averange degree d and for e G M /ef k^ denote the size of independent 
sets such that 

lim \nE[Xn,rn{ke)]/ke = e 

We should remark that setting fd{s) = lim„_>.oo In E[Xn,m{sn)]/{sn), where s £ (0, 1) it is direct to 
show that for a given d the function fd{s) is decreasing. In particular- it holds that 

Thus for e < e' we get that k^ > k^i. 



Lemma 6 Consider the graph G* ^ of expected degree d = 2m/n. For e > 7v/ ^^ it holds that 



P[a(G* )>A:,]>l-exp -e' 



n 



4Mln(i/ 

The proof of Lemma [6] appears in Section 1431 All the above imply that for G* ^ of expected degree d 
the size of the maximum independent set is w.h.p. at least kcr = k/ , -ijiA ^'^'^ 

E {Xn.^(k,r)\ < exp (un^Jln'' d/dA . 

Theorem [8] follows from Lemma|7j 

Lemma 7 The graph G* „^ of expected degree d with probability at least 1 — exp (— nln^ d/d^) has an 
independent set of size k if 



d < 2{n/k) \n{n/k) + 2{n/k) - 0(ln(n/A:) VK^) 

Proof: For given k, let s = k/n. In the derivations below assume that s is sufficiently small constant. It 
is direct to show that 

-\nE[Xnm{k)] = -h(s) + dln(l - s'^)/2 - (1 - o(l))lnn/2n, 
n 

where h{x) = x Inx + (1 — x) ln(l — x), for x £ [0, 1]. Let d^ = d'^{k) be such that 

-h{s) + d+ln{l-s'^)/2 = 0. 
It is direct to get that 

d+ = 2—^^^^ = 2s-i Ins-^ + 2s-i - 1 - 0(s Ins"^). 
ln(l — s^j 

Assume that we have a graph G* (n, tti') of expected degree dk = 2s~^ In s~^ + 2{s^^) — C In s~^ /^/s, 
where G is a sufficiently lai^ge constant. It is easy to check that it also holds that k = 2n(l — 
Odf. (1)) In dk/dk, where Od^ (1) is a quantinty that tends to zero as d^ increases. 

It holds that d'^ > dk- Add to G*{n,m') edges at random so as to increase the expected degree 
to (i+, i.e. we insert {d'^ — dk)n/2 edges at random. Each independent set in G*{n,m') is also an 

independent set of G*{n, m) with probability (l — s^y . Then it is easy to get that 

E\X^,^(k)\ = (\- s')^"^-^'-'^'^''' E{X^^^,{k)\. 
10 



Since E[Xn,m{k)] = G(n~^/2) we get that 

-lnE[X^^^,{k)] = ln(l-s2)(4_d-)/2-o(l) 
n 

1 S 

The lemma follows by using the fact that s = 2(1 — 0^(1)) In d/d and by taking sufficiently large C. 



4.3 Proof of Lemma |6] 

Lemma 8 (Second Moment Bound.) For the graph G* ^ with sufficiently large expected degree d 



2m /n and e > 3y In d/d it holds that 

( 571nii/2rf \ 
Pr[X„,„(fc,) > 0] > exp ^3 k, \ . 

Proof: It holds that 

The lemma will follow by bounding appropriately the r.h.s. of the above inequality. Note first that for 
such e it holds that E[Xn^m{ke)] = exp I 6nWln^ d/d^ ) . When there is no danger of confusion, we 
abreviate k^ to k. We have that 



where 



Let 



EjXl^ik)] ^ J^ {1){lrJ:){l-2ik/nr + ii/n)T ^ 
E^[Xr.,m{k)] to {l)i^-ik/nrr h 

(^)iTJ){^-nk/nr + {^/n)T 



ai 



ai 



il) (1 - (fc/n)2)2- 






ai {i + l){n-2k + i + l) \ ^ n"^ 

where (/j = (1 + 0(ln^ d/d"^)) > 1 is a quantity that tends to 1 with d. 

It is not hard to see that for small i, the sequence bi decreases with i because of the factor {i + 1) in 
the dominator. For intermediate i it grows due to the factor (1 + gj^^)™. Finally, when the difference 
k — i becomes small, bi declines. Thus, Oj achieve its largest value either at ii = min{i > l|6j < 1}, or 
at 12 = maxji < A;|6j > 1} + 1. 

Claim 1 Let s = k/n, then ii G [sk, sk{l + sln^/^ d)]. 

The proof the claim appears after the proof of this lemma. Furthermore, we show that 12 has a lower 
bound 22 = k (1 — y^k/n) (obviously 12 < k). More specifically, it holds that 

bi'^.i > (fc/n)2 exp (mHl^ + 0(m(2i'2)Vn4)) 

> 3^ exp (2 In d - 2 In In d + 2 - 2 In 2 - Od(l)) 

> 3^ exp (2 In d- 2 In In (i + 0.6) > 1. 
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Thus, i2 > ^2- ^^ f^' ^^ ^'h i^ regai'ded we get the following: 



©(l-(fc/n)2)™V (l-(A;/n)2) 

/fcx fn—k\ 
Kin) Kk—in) 



< 



E[Xn,mik)] 



^ 71— TmI^^P -63^7^77—7 7T^ 



{k-i'^y "^y (pn{k 



2) 



< ( ^j— exp(-2.121nd) 
\ln d 

It is direct that for sufficiently large d, it hold that 0,2 < o(n^^). Thus, the main contribution to J2i "« 
comes from the terms with indices around ii while the contribution of the terms with indices around 12 
is negligible. For s = k/n, it is direct that («i/n)^ < s^(l + 3s In^" d). It holds that 

Oil S (^ (v^^ 



1 + 



3.5ln3/2^\'"/' 



{1-s 



2\2 



< exp (s.Bs^dln^/'^ dn) < exp [ 56^^—3 — k 



The lemma follows by noting that 










Proof of Claim [TJ For x € (0, 1) let H{x) = xlnx + (1 — x)ln(l — x). Also, let k^^^ = sni = ak 
and m = |n, for < s, a < 1 fixed real. 

^w - ^"{(:)G)(r-i)(i-w»f+W"'^)1 

= -H{s) - H{a)s - H{s{l - a)/{l - s))(l - s) + f In (l - 2{s)^ + (as)^) + o(l). 

Taking the derivative of the function /, above, we get 

d .. . / 1-a ^ s(l-a) , / s(l-a)\\ das^ 

—f{a) = s In h In -^ - In 1 !^ + 



da^ V a 1-s V l-« // l-2s2 + (as)2' 

Note that /'(s) = ^_JJj'^^4 > 0. Forao = s+t where i = s^ln-^/"^ d,it\\o\A?,thdl\n{aQ) > lns+V(2s), 



ln(l - ao) < 0, In ^%^ < In s and In ( 1 - ^^^ ) > 2s. Also, it is direct that ^_^^lfusY - i^*^" 



Thus, for sufficiently large d, we get that 



/ ao < -i/2 + 2s2 + Srs^ < -2—-^ + 13—^ < 0. 

a^ d^ 
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By the fact that /' is continuous in the interval (0, 1) and f{s)f{ao) < 0, there exists at least one 
a' G {s,s + In^/^ ds'^) such that f{a') = 0. Also, it is direct that /(a) > 0, for a < a', and /(a) < 0, 
for a > a'. This implies that there is exactly one point in this interval where /(a) = 0. 



Proof of Lemma |6] We work in the same manner as in the proof of Theorem 7.4 in |[22|, i.e. we apply 

^ and e, > 7^^ 



the Talagrand's large deviation inequality. Let €2 = 'i^^hrj^ and ei > T ^"^,^^ - Using Lemma [3] it is 



direct to show that k^,^ = 2n(l — 0^(1)) In d/d. It holds that 

Pr(«(G;„) < h, - l)Pria{Gl^J > h,) < exp (-(A:,, - K, + 1)^4^,) (9) 



It is easy to show that k^^ ~ ^ei is not too small. In particular, setting ri = 7w In d/d and r2 = £2, it is 
easy to see that k^^ — k^-^ > k^^ — kr^ . 

Claim 2 Let n = 1^^ and ra = 3^^^. It holds that 

d 
The proof of the claim appears after this proof. Using the claim above and (|9]) we get 

Pr(a(G;„) < fc„ - l)Fr(a(G;.,„,) > fc„) 

< exp fJ4fe-2..y)^\ _ ,^p / ,(.. - 2.0^n 



8nlnd/d / \ dlnd 

The theorem follows by combining Lemma [8] with the above inequality to get that 

PrHGl^m) < ke, - 1) < exp (-^^l-^n + WO^-^nj < exp 



'' -.n 



49dlnd 







Proof of Claim|2t Lemma [3] suggest that kr^ , kr^ > 1.8 In d/d. Let f{s) = lim„_5.oo - In E[Xn,m{iT'S)] 
for s G (0, 1). f{s) is continuous in (0, 1). For convenience we let si = fcr^/n and S2 = kr^/n. By the 
Mean Value Theorem, there exists s G [si, S2] such that 

f{s2) - fjSl) ^ f,. 
S2- Si 

It holds that 

ds 

= — In s + ln(l — s) — dj 



''/(^) = i{-slns-{l-s)ln{l-s) + pn{l-s^)) 



For sufficiently lai^ge d, there exists x G [1.8,2] such that s = xlnd/d. This implies that —2lnd < 
f{s) < — ^ In d. Using the initial equality, we get that 

^ ^ _ /(^2)-/(.l) 

''-'' - m 

^ nsi -r2S2 ^ 2 ^1 ~ ^^2 ^ (n -2r2) 
Ind - ^^ 0.9 In d " d 

The penultimate inequality follows from that fact that /c^i , kr^ are very close to each other, which implies 
kr2 < 2kr^. For the last inequality we the fact that si > 1.8 In d/d. The claim follows. 
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Figure 1 : The short chains 




Figure 2: o", r with Property F 



5 Proof of Theorem [I] 



Instead of the random graph model G(n, m) we consider the model G{n,p), where p = d/n for fixed 
real d and we prove the following theorem. 

Theorem 9 There is e^ — )• such that Sk{G{n, in)) is 0{l)-connected w.h.p. for any 

1 I . In d 

K<{\ - ^d)—f- -n. 

Theorem [T] follows from Theorem|9]by using standard arguments (see Proposition 1.13 in 123). 

Remark. For simplicity, we assume that for two adjacent independent sets their Hamming distance is 
less than 20d. 

Definition 5 For every vertex u in Gnp we let N^ denote the set vertices which are adjacent to u. 

A sufficient condition for establishing the connectivity of Sk{Gn^d/n) is requiring this space to have the 
so called Property F: 

Property V. For any two (ai^bitrary) cr, r G Sk{Gn^d/n) we can find chains a, a', a" and t,t',t" of 
independent sets in Sk{Gn.d/n) ^nd Sk+i{Gn.^d/n) connected as in Figure[T] Furthermore, we have that 

a", t" G Sk{Gn4/n) and dist{a" , t") < dist{a, r). In particular it holds that \a" D t"\ = |o- n r| + 1. 

Theorem |9] will follow by showing that with probability 1 — o(l) Sk{Gn^d/n) has Property F. For two 
independent sets a, r G Sk{Gn^d/n)^ we introduce the notion of "augmenting vertex", which is very 
closely related to Property F. 

Augmenting vertex. For the pair a,T ^ Sk{Gn^d/n) the vertex ti G y\((7 U r) is augmenting if one of 
the the following A, B holds. 

A. iv:„ n (o- U r) = 

B. A^t, n 0" n r = and there are the terminal sets /t,(cr) and Iv{t) of size at most Id such that 

• iV„n4(cr) = 

• Iv{cr) is an independent set of Gn^d/n 

• \h,{a)\ = \N„na\ 

• yw £ Iy{a) it holds that {N^ n cr| = 1 and \Nyj n iV„ n o-| = 1 

The corresponding conditions should hold for Iv{t), too. 
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Figure 3: The independent sets o"', r'. 
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Figure 4: Final sets 



Figure [2] sliows an example of a pair of independent sets where the vertex v is an augmenting vertex. 

The notion of "augmenting vertex" and the Property F are closely related since we will show that for 
a pair o", r G Sk{Gn^d/n) that has an augmenting vertex v we can find short chains a, a' , a" and r, r', t" 
as in Figure [U Thus, if we can find an augmenting vertex for any two members of Sk{Gn^d/n)' then 
Sk{Gn,d/n) has Property T. 

In what follows we introduce the process called Collider which takes as an input a, r and the 
augmenting vertex v and returns the independent sets a" and t" of the chains. 

Collider {a, r, v): Let V^, V^ be the set of vertices in a and r , correspondingly. 
Phase 1. 

1. Remove from Va all the vertices in A'^j, n a. 

2. Insert into V^ the vertex v and the vertices in /„ (a) q 

3. Do the same for V^. 

Phase 2. 

1. Remove some vertex from V^ that is nor v neither some vertex in o" n r. 

2. Do the same for Vr- 

3. Return the sets Va and Vr- 

End 

Figure[3]shows the changes that have taken place to the independent sets in Figure|2]at the end of "Phase 
1" of Collider((T, r, v). The independent sets a' and r' in the short chains correspond to the vertex sets 
Va and Vr at the end of the "Phase 1". Both a', t' now contain the augmenting vertex v, moreover, it 
holds that a' n r' = (a n r) U {v}. The independent sets a" and t" correspond to the sets Va and Vr 
that are returned by Collider((j, r, v). After "Phase 2", the independent sets in Figure [3] are transformed 
to those in Figure |4l There the vertices U2 and u-j are removed from a' and r', coiTcspondingly. 
In the following lemma we show all the above properties of Collider. 

Lemma 9 Let a,T G Sk{G) with augmenting vertex v. Let a" and t" be the two sets of vertices that 
are returned from Collider(o", r, v) . The two sets have the following properties: 

L a",r"G5fc(G), 

2. \a" r\T"\ = |crnr| + 1, 

3. There are a' ,t' G Sk+i{G) such that a' is adjacent to both a and a" while t' is adjacent to both 
T and t". 



hfN^niV^lJVr) 



I then we set Ivio') 
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Proof: First we show that V^ and Vr, as returned by Collider {a, r, v), con^espond to independent sets 
of G. The same arguments apply for both y^ and Vr, we consider only the case of V^. 

Let V be an augmenting vertex for the pair a, r. If A'^y n (cr U r) = 0, then it is direct that the set V^, 
at the end of the process, is an independent set of G. Consider, now, the case where N^ Ci {a U t) y^ 0. 
Assume that Vo-, at the end of the process, does not correspond to an independent set of G, i.e. there is 
an edge between two vertices in V^. Then, the edge must be either between the new vertices that are 
inserted in y^ (i-C- v and Ii,{a)) or between some newly inserted vertex and an old one. 

The first case cannot be true since the assumption that v is an augmenting vertex implies that there is 
no edge connecting v with /^((t), or vertices in Ii,{a) with each other. As far as the second case is con- 
sidered note that both v and Iv{a) have the same neighbours in a. During the process Collider{a^ r, v) 
all the vertices in ¥„ that are adjacent to v and Iv{(y) are removed. The second case cannot occur. Thus 
the sets Va and Vr, as returned by the process Collider {a, r, v), correspond to independent sets of G. 

It is direct to show that after the execution of Collider{a, r, v) the sets Va- and Vr are both of size k. 
Property 1 follows. 

Property 2 follows by noting that |cr" n t"| = jo" n t| U {v }. As far as the property 3 is regarded it is 
easy to see that a' corresponds to the set V^ exactly at the end of "Phase 1". We argue similarly for r'. 


We are going to use the first moment method to show that with probability 1 — o(l), the graph 
Gn,d/n has no pair of independent sets in Sk{Gn^d/n) with no augmenting vertex. This implies that with 
probability 1 — o(l) the set Sk{Gn^d/n) has Property T and Theorem |9] follows. 

We compute, first, the probability for a pair in Sk{Gn^d/n) to have an augmenting vertex. 

Proposition 3 For a,T £ Sk{Gn^d/n) <^f^d \a D t\ = i < k, let pk^i be the probability for this pair to 
have an augmenting vertex. There is e,^ — t- such that for any e^ < e < 1 — e^ and k = {1 — e)^n the 
following is true 



Pk,i > 1 — 2exp I — 0.8— n 

The proof of Proposition [3] appears in Section ISTT] 

Proof of Theorem |9t Let Zk be the number of pairs of independent sets of size k in Gn,p that do 
not have augmenting vertex. Theorem follows by showing that E[Zk] = exp (— 0.5^n) and then by 
applying the Markov inequality. It holds that 

\ fn\ /k\ fn — k\ , ■,<){''']_{*'] , 

Pk,i) 



. .e,p(-„4„)t0Q(;-)a-„^(')-«) 



=0 

where the last inequality follows by Proposition |3] Let X^ be the number of independent sets of size k 
in Gn^d/n- It is direct that 

E[Zk] < 2exp{-0.8d'n/d)E[Xl]. 

By the relation in (7.10) in 1221 (page 182) we have that 

E[Xl] < {E[Xk]fke^p{k/2ln^d) 

< exp (4nln d/d) A; exp (n/dln d) < exp (5nln d/d) . 

It is clear that for sufficiently large d, and 3^^^ < e < 1 we have E[Zk] < exp (-O.Sf n). The 
theorem follows. 
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5.1 Proof of Proposition |3] 

Given an arbitraiy pair a,T e Sk{Gn^d/n)' we are going to construct an appropriate set Qo containing 
vertices that each of them can be used as an augmenting vertex of the pair a, r. Propositions [3] follows 
by deriving an appropriate lower bound for i-'r[|(5o| > 0]. First, we need to define some sets of vertices: 

Qi(o") Qi{(t) C V\{a U r) contains those vertices that have with exactly one neighbour in a and they 
do not have exactly one neighbour in r. For the augmenting vertex v G Qq, we require that the 
set of terminal vertices /„(cr) belong to Qi{(j). 

Q2(c) <52(o") ^ o"\t is the set of vertices that have at least one neighbour in Qi{a). Since we have 
required that for any v ^ Qo the associated terminal set belongs to Qi (a), then it is clear that only 
vertices in Q2(c) can be in A^^ n a. 

Q3((j) Let Q3{<j) C V\{a U r U Qi{cr)) be the set with the following three properties: 

Si- \fw e Qsia) it holds A^^ n cj C Q2{a). 

52- |iV«, ncr| < 7d. 

53- For any two x,y £ Qsi^a) it holds that {N^; n iVy n cr| < 1. 

In an analogous manner we define Qi{t), Q2{t) and Q3(t), w.r.t. the independent set rQ 

If any u G Qzi'^) H QsCt) does not have any neighbours in cr n r and we can find appropriate 

terminal sets luic), Iu{t), then u is an augmenting vertex. Qq will consist of all the augmenting 

vertices in Q^ia) n Qsir). 

So as to derive a lower bound on Pr[QQ > 0], first we provide a concentration result for Qsia) n 

Q3{t). Then, we provide a conceti^ation result on the number of vertices in Q^ia) n Qs^r) that are 

augmenting. 

Claim 3 Let Xi = \Qi{a)\. It holds that E[Xi] = ^^~jl['"^ n{l - Od(l)) + 0(1) and 

Pr \Xi - E[Xi\\ > e(n^/^) < 2exp (-n^^'^ 
The proof of Claim [3] appears in Section [5^ 
Lemma 10 For u G V\{(t U r U Qi{a)), it holds that 

Pr[u G Q3(^)] > 1 - ^ '-^^ 2^~ " ^(^~^- 

The proof of Lemma [TOl appears in Section 1531 

For u G V\{a U r U Qi{(t) U Qi{t)) let Ju be the indicator random variable such that J„ = 1 if 
u G Q'i{(j) n Q'i{T) and J„ = otherwise. It is easy to verify that E[Ju] = Pr[u G Q3(cr)]^ for every 
u G V\{a U r U Q\{a) U Qi(t)). Note that the indicator random variables J„ are negatively correlated 
with each other. More specifically, due to condition ^3 it holds that 

Pr[,h = l\Jy = 1] < Pr[J:, = 1] 

since the event Jy = 1, can reduce the choices of x for neighbours in (52(0') and Q2{t). Let X3 = 
J2u -^u- Using Claim[3]and Lemma [TOl we get that 

E[X3] > (1 - Od{l))n{P[v G Q3{a)]f > (1 - Orf(l))n, for v G V\{a U r U Qi((7) U Qi(t)). 



■*Note that it always hold that Qi (cr) n Qi (r ) 
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Since X-^ is a sum of negatively depended random variables, Theorem |7] suggests that 

Pr[X3 < 0.8n] < exp (-3.8n/100) . (10) 

So far we have shown that the size of Qal'^) ^ Qi,{t) is large w.h.p. The proposition will follow 
by showing that many of the vertices in Qi,{cr) n Qi,{t) are augmenting. In particular, every u G 
Q3(c) n Qz{t) is augmenting, i.e. belongs to Qq, if the following two conditions hold: 

Zi. u does not have edge connecting it to cr n r. 

Z2. there exist Ru{cr) C Qi{(t), Ru{t) C Qi{t) which can be used as the terminal sets Iu{c^) and 
Iu{t), correspondingly. 

Claim 4 Each vertex in Qsicr) n Qsir) satisfies Z2, independently of the others with probability 

PA>1- 50(iVn. 



The proof of Claim |4] appears in Section [ 

Let X4 be the number of vertices in Q^ia) n Q^ir) that satisfy condition Z2. Using (ITOl ) and 
Chemoff bounds we get 

Pr[Xi < 0.7n] < exp (-n/1000) . (11) 

Assume that |(T n r| = a ■ k, for some a G [0, 1]. It is easy to show that each vertex in Q3(cr) n Q3{t) 
that satisfies Z2, independently of the other vertices satisfies Zi with probability d""^^"*^^ + 0{n~^). 
WeletB="X4 > 0.7n", it is direct that £;[|Qo||^] > 0.7nd-"(i-^) -0(1). Since a G [0, 1], there exists 
e' = e'(e, a) > e such that a(l — e) = 1 — e'. Applying the Chemoff bounds for Y we get that 

Pr[\Qo\ < 0\B] < exp (-0.7d''n/d) < exp{-0.8d'n/d) . 

By the law of total probability and (ITTI) we get Pr[|(5o| < 0] < exp (— 0.8^n) . The proposition 
follows. 

5.2 Proof of claims Ha 

Proof of Claim |3l The probability for a vertex to be in Qi{cr) is 

, / si- 1 / , , ni. In (1 — e)lnd / (1 — e) Ind ^, 1 , 
PI = kpil-p)'-Hl - kp{l-pf-^) = ^ ^,1 [l - ^-^^ 0{n'\ 

It is direct that 

E[Xr\ = (1 - 2s)np^ = il^ili^n(l - o,(l)) + 0(1). 

Applying the Chemoff bounds we get the claim. 

Proof of Claim m For some vertex u G Qsicr) n Qsir) there is at least one set Ru{o') C Qi{a) such 
that the following is true: Every vertex in Nu n a has exactly one neighbour in Ru{cr). By definition it 
also holds that every vertex in Ru{(y) has exactly one neighbour in Nu n a. 

Assuming that the condition Zi holds for u, then Ru{(y) can be the terminal set /„(cr) iff no two 
vertices in Ru{(y) are connected with each other and no vertex in Ru{cr) is connected to u. 

Since each neighbour of u in Q2{c^) can have many neighbours in Qi{a), possibly, there can by 
many choices for the set Ru{cy). It is direct that the probability for u to fulfil condition Z2 is minimized 
when there is only one choice for Ru{(7) and this set has the maximum possible size. 
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Since the maximum number of neighbours of n G Qsicr) n Q3{t) in Q2(c) is at most 7d we have 
that \Ru{(^)\ < 7d. It holds that 

Pr[there exists edge with both ends in Ru{a)] < ( ) d/n < 49d^ /2n, 
Pr[u is adjacent to some vertex in -R„(cr)] < Id^ /n. 

The same holds for i?„(r). It is direct that 1 - P4 < 2 (49dV(2n) + 7d^ /n) < 50d^/n. 

It remains to show that each vertex in Qsicr) n Qsir) satisfies Z2 independently of the others. Note 
that we have the restriction that no two vertices in Qsicr) (in Q3{t)) shai^e more than one neighbour in 
(52(f) (Q2(t))- Thus, each edge between any two vertices in Qi{a) (or in <5i(r)) eliminates at most 
one vertex in (53(0") n Qsir). The independence follows by the fact that the edge events in Gn^d/n ^^ 
independent. 



5.3 Proof of Lemma [10] 

Before the proof of Lemma [TOl we need a concentration results on the size of the set (52(c). 

For every u G cj let J^ be the indicator r.v. such that J^, = 1 if u E Q2(c) and zero otherwise. 
Conditioning onw £ Qi{cr) the vertices in a are adjacent to w, equiprobably. It holds that 

d ^-^■ 



Let the event A = "Xi > (1 - e(n~V3)) Ilzfi^j^". ^g ^ave that 

7 = Pr[J^ = l\A] > 1 - exp {-d') - 0{n'^). 

Let X2 = ^u£a Ju — 1^2(0") I- It holds that £'[X2|^] > (1 — a)kj. It is easy to see that the random 
variables J^ are negatively depended. Using Theorem |7] we get that 

Pr[X2 < (1 - e(n-i/'^))(l - a)kj\A] < exp (-?i°-^5) . 

Using Claim [3] and the law of total probability we get 

Pr[X2 < (1 - e{7i-^/^))kj] < exp (-n°-^) . (12) 

Proof of Lemma HOl Consider the set Qsicr) C V\{a U r U Qi{a)) which has the same properties as 
Qsia) apart from ^3. First, we derive a lower for Pr[u € Qs]. 

Let a € [0, 1] be such that |(t n t| = ak. For each u G V\{a U r U (5i(cr)) let d^^riu) be the 
number of vertices in a\T which are adjacent to u. If dc^^r{u) = i, then all the subsets of size i in (j\t 
ai-e equiprobably adjacent to n, i.e. Pr[u G Q3{cr)\d„^Tiu) = i] = (^^)/(^^ j"''')- 

For a > 1 — n~^'^ we have that 

PriueQsia)] > Pr[iV„ n (a\T) = 0] = (1 -p)(^-'^)'= 

> exp(-(l-a)fc^)>exp(-n-V4i,rf)>l 1^^ ^'^^ 



n 



1/4' 



Assume now that a < 1 - n^^/^. Let the event A ="^2 > (1 - Q{n~^/^)){1 - a)k'y" where 7 
(1 — exp(— d'^)). Assuming that the event A holds, for z < 7d we get 

i") -f ^^ ^ (i-o(l)) = y(l-o(l)). 



(^(i-a)fc) \{l-a)k 
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Let the event C ="d(j,r(^) 7^ 1 and dfj^riu) < 7d". Note that drj^r{u) is binomially distributed with 
parameters (1 — a)k and p = d/n with the restriction that the event C holds. We have 



Id 



Pr[u G Qsl^] > Y. ^'^[^ ^ QsKrW = i, A]Pr[d„^r{u) = i\A] 



i>0 



Id 



s ^5;^ EP7'1p"(i-p)'-»"-V-7, 



where 

g = (1 - a)A:p(l - p)(i-«)'= < d-(i-^) Ind. 

Note that 

because 7 < 1. Also 

i=7d ^ * ^ j=7d ^^^ 

The last inequality follows from Corollary 2.4 in [i22il . We get that 

> exp (-(1 - a)(l - e)e-'^' Ind - O(n-i)) - exp(-7d) 

(1 — a)(l — e) Ind , „ ,x ^. i^ 

> 1-^ '-^^ exp-7d -On-i. 



By the fact that Pr[C] < 1, and 7 < 1 we get 

^ r A , .1 (1 — a)(l — €)lnd d'^lnd , . ^, 1, 

Pr\u G Qsl^] > 1 - ^ ^dT^ d exp(-7d) - 0{vr^). 

Noting that Pr\A\ > 1 - exp (-G(n°-^)) we get that for < a < 1 - n~^l^ 

Pr\u G Q3(^)] > 1 - ^ ^^^ 2^~ " ^('''^- 

By (fT3] ). it is direct that the above lower bound of Pr\ii G Osl'^)] holds for any a. 

Given that v G Q?,{p^ it is easy to see that it does not violate Sa with probability at least 1 — 0{n 
The law of total probability impUes that 

Pr[u G Q3] > 1 - ^ ^^^ 2 ^~ " ^^''"^- 

The lemma follows. 
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6 Proof of Theorem |2] 

To measure how close a, r G Sk{Gn,m) are, we define 

The function /^ is a map from the set (^^') to the interval [0, (k/n)]. There is a direct correspondence 
between /^(t) and dist{a, r), the Hamming distance between cj, r. The greater /^(t) the smaller the 
Hamming distance between a, t. For a fixed a G S{G) and the real A > let 

ga,G,x{x) = \{t G [n]k : (/.(r) = x) A H{t) < Xn}\. 

Theorem |2] follows as a corollary of the following lemma. 

Lemma 11 There is e^ — )• such that for any e,^ < e < 1 — e^ and A; = (1 + e)^n the following is 
true. There are < yi < y2 < (k/n)'^ and A, 7 > such that w.h.p. a pair (G, a) distributed as in 
Uk{n^ m) has the following two properties. ■ 

1. For all X G [2/1,2/2] we have ga,G,x{x) = 0- 

2. The number of independent sets r G Sk{G) such that fa{T) > 2/2 i^ <^t most exp(— 7n)|5fc(G)|. 

In the above lemma we take e^ > ln~^'^ d. Then, using Lemma[3l Lemma|2]and Proposition [T] suggest 
that w.h.p. \Sk{G)\ > exp(2is^n). 

The above lemma states that w.h.p. a 1 — o(l) fraction of independent sets in Sk{Gn,m) satisfy 1. 
and 2. We partition the space Sk{Gn,m) into subsets as follows: For each a G Sk{Gn,m) we define 

C = {r G Sk{Gn,m) ■ fair) > 2/2}- 

Start at S* = Sk{Gnm) and remove iteratively some €„ for cr G S" satisfying 1. and 2. of Lemma [TT] 
It direct that we remove exponentially many subsets of Sk{Gnm)- Furthermore, each such subset €„ is 
separated by linear Hamming distance from Sk{Gn,m)\CcT- Thus, Theorem |2] follows from Lemma [TT] 
To establish Lemma [TTl we use the planted model. 

Lemma 12 There is e^^ — t- such that for any e^ < e < 1 — e^i and A; = (1 + e)^n the following is 
true There are < 2/1 < 2/2 < (k/n) and A, 7 > such that with probability at least 1 — exp (— ^'t-) 
a pair {G, a) distributed as in Vk{n, in) has the following two properties 

1. For all X G [2/1,2/2] we have ga,G,\{x) = 0. 

2. The number of independent sets r G Sk{G) such that fa{T) > 2/2 is at most exp I — ^— '^ 



Proof: Consider the pair {G, a) G Afc(n, m) chosen according with distribution Vk{n, m). It direct that 
for T G (f^') the probability that H{t) < An, w.r.t. to G, is at most 

exp((2/(A)+o(l))n) 



1 - {k/ny 
where lim;!^_j.o uW = 0- For the rest of the proof assume that we take A sufficiently small such that 
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We prove the lemma by computing A{x), the expected number of subsets r G [nj^ such that H{t) < 
Xn and /^(t) = x, for suitable yi < x < 7/2- Let, also, s = {k/n) and assume that x = s(l — b), for 
6g [0, 1]. It holds that 

22 \ k—xn / -1 n 2 , 2 \ "* 

ens \ /I — 2s + x^ 



(2 \ bsn 
^ J (1 - s2(l - (1 - 6)2))" exp((y(A) + o(l))n) 

< exp [hsn[2 - In 6 - In s - (1 + e) In d(l - 6/2)] + (y(A) + o(l))n] . 

Using the fact that — log s < log d and —qlnq < e^^ for any real g > 0, we get 

ilnSW(x)l < sr2-2e~^+61n(i(l-(l + e)(l-6/2))l +y(A) + o(l). 
n '- -' 

Taking sufficiently large d and sufficiently small A we get that ^ ln£'[^(x)] < for lOln^^ d < b < 
Yh. In particular, for such 6 we get that i?[A(x)] < exp (— ^'^)- 

The independent sets that belong to the same cluster as a are those for which 6 is very small (the 
overlap x is big). We should have 6 so small that 2 — 2e~^ > —b\n.d{\ — (1 + e)(l — 6/2)). Then, it is 
easy to see that i?[j4(2;)] < exp (^^n) . The lemma follows by the Markov inequality. 



7 Proof of Theorem |3l 

Definition 6 For the graph G = {V, E) and an independent set a ^ V in G, a vertex v G V\a will be 
called (j-pure if it is not adjacent to any vertex in a. 

Let (G, a) be a pair chosen from the distribution Vk{n, m). If we condition on all vertices in "P C 
V\a being pure, then this implies that there are no cr-'P-edges, but it does not impose any conditioning 
on the subgraph G [P] induced on V. Hence, G [P] is just a uniformly random graph, given its number 
of vertices and edges. More specifically we have the following: 

Lemma 13 Consider the pair {G, a) distributed as in Vk{n, m)for k = {\ — e)-^n and e G [d~^'^, 1]. 
With probability at least 1 — 5 exp ( — 3^) the "a-pure" vertices in {G,a) induce a graph which is 
distributed as in Gn,m with N > {1 - d~^/^)^n and M < (1 + d^^/^)^n. 

Proof: Let Gn^pic) be the graph Gn,p conditional that a G Sk{Gn,p)- Assume that p = — and D = 

(i-^2)(f_^-./3) > d. For every v G V\a let 



_ J 1 if w is cr-pure in Gn,p{o'), 

^ \ otherwise. 



It holds that £[1^] = (1 - p)^ = (1 - 0(n^i))f . Also, let Xpr = Ene^V -^^- ^^ *^ linearity of 
expectation we have E[Xpr] = {n — k){l - p)^ = (1 — (1 - e) Ind/d - 0{n^^))^n. Applying the 
Chemoff bounds we get that 



Pr 



\Xpr - E[Xpr\\ > nd^^l'^/d < 2 exp {-n/M) . (15) 
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Let the event A ='\Xpr — E[Xpr]\ < ^-^n". Let Ypr be the number of edges which are incident to 

a-pure vertices only. It is direct that £^[y^r.|^] < (1 + 3d~'^'^)^n. Applying the Chernoff bounds we 
get 



Pr 



Ypr>{l + d-'/'^)—n\A 



<exp(-^). (16) 



Let the event 5 ='%,• < {l + d-'/^)^n". By dBJl and ([HI) wegetthat Pr[^, B] > l-4exp (-|^) . 
The lemma follows by bounding appropriately the number of edges in Gnp{cr). Let Y be the number 
of edges in Gn,p{<7)- Clearly, E[Y] = 2-in-rj-'=/3i ^" Using Chernoff bounds it easy to show that 

Pr[Y <m]< exp (-nd'^'/^/4\ . 
The lemma follows. 

Combining Lemma[6]and Lemma[T]we get the following. 
Corollary 3 For the graph Gn,m of expected degree d it holds that 

Pr [a{Gn,rn) > 2n(l - Od{l))\nd/d] > 1 - exp (-^) ■ 

Proof: Consider G* ^ of expected degree do = d{l + 4(i~^/^). Using Lemma[3]and Lemma[6]we get 

that 

2n 



2n 
aiG*^J > — (In do - In In do + 1 - ln2 - 10) 

"0 



> 1 — exp 



do In do 



Pr 

Substituting do = d(l + Ad''^/^) 

Pr[a{G*^.J > 2n(l-Orf(l))lnd/d] > 1 - exp (-n/(d In d)) . 

Then, noting that do(l — 2dQ ) > d we use Lemma[T]to get the corollary. 

If / C P is an independent set, then so is cr U /. Hence, Lemma [T3] and Corollary [3] imply that a 
random pair (G, a) chosen from the distribution T'fc(n,m) is (2e/(l— e),0)-expandable with probability, 
at least, 1 — exp(— n/(dln d)). Finally, invoking Theorem[5J we see that a pair {G, a) chosen from the 
distribution Uk{n, m) enjoys the same property w.h.p., whence we obtain Theorem[3] 

8 Proof of Theorem 1 

Now, assume that A; > (1 + e)n ln(d)/d for e > 0. In this case it is direct to show the for the pair (G, a) 
distributed as in Vk{n,m), the (expected) number of u-pure vertices is bounded by d^^~^n <^ k. 
Indeed, it is not very difficult to show that all but, say, d~^^^' ^n vertices v G V\a have at least O.lln d 
neighbours in a w.h.p., provided that d > do is not too small. Let us call these vertices blocking. Thus, in 
order to create an independent set of size bigger than {l + c)k, for small fixed c > 0, we need to include 
I > ck — d~'^^^/'^n > ck/2 blocking vertices w.h.p. Furthermore, since the bipartite graph spanned by 
a and the blocking vertices is random, it has excellent expansion properties w.h.p. Indeed, via a first 
moment argument one can show that these expansion properties imply that there is no independent set 
r that contains / > ck/2 blocking vertices and ck vertices from a. More precisely, the first moment 
argument yields the following bounds. 
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Lemma 14 There is e^ — )• such that for any e^^ < e < 1 — e^ and A; = (1 + e)^n the following is 



d 

f €-2/Vlnd 



true. For {G, a) distributed as in Vk{n, m) and for 5, 7 G [0, 1] such that either 5 < 7 + 2- — j— ^ — or 
(1 + 7)/c > a{G) we have that 

PVk{n,m.) [^^ {G, cr), a is not (7, 5)-expandable ] > 1 — exp ( —n J . 

Proof: Let (G, a) G Afc(n, in) distributed as in Vk{n, m), where A; = (1 + e)^n and e > 0. Let E be 

the event that o" is (7, 5) -expandable where 5 < 7 + 2 '^~ j^ " . Let ^^(7, S) be the expected number 
of independent sets in G which have size (1 + ^)k and shai^e (1 — 5)k vertices with a. For s = |cr|/n 
we have 



\2„2 I t^ x\2„2\ m 



/c \/ n-k \/l-s^-(l + 7)^s^ + (1 - (5)^s 



{l-6)kj \{l + -i)k-{l-5)kj V 1-s 

k \ ( n \ f ^ ^ f 1 , 7 ~ ^ \ / ,!:\2 



.2 



By the fact that —x In x < e~^ for x > and — hi s < In d we get that 

ihi^<,(7,5)<i:^ + (7 + <5)+e-i-(7 + 5)ln(i((l + e)(l + ^)-l). 

For 6 = 6{q) = 7 + 2^^, for some g > 0, we get that (1 + (7 - 6)/2){l + e) - 1 = q. Thus 

^ In ^,(7, 5) < 4 - 2 ("7 + i^^) q\nd. 

Forg > 2/\/imiande > 10/\/lnd we get ilnA^(7,(5) < -1. 

It is easy to show that for given 7 the expected ^^(7, 6) decreases as 6 decreases. Thus, the lemma 
follows by noting that for 5 = 7 + 2^=^^^ it holds that Pr[E] < nA^{-f, 5). 

Theorem |4] follows by using Theorem |5l 

9 Proof of Corollary ID 

For the Metropolis process described in Section [L2l it is easy to see that every state in J7 = |J^„ Sk {G{n, m)) 
communicates with every other. Thus the process is ergodic and possesses a unique stationary distribu- 
tion. Let vr : il — > [0, 1] denote the stationary distribution of the Metropolis process with parameter A, 
for some A > 0. It is easy to show that 7r(cr) = \\"\/Z where Z = J2aen ^''^'■ 

Here, we are interested in finding the rate that the Metropolis process converges to the equilibrium. 
There are a number of ways of quantifying the closeness to stationarity. Let P*((j, •) : — )• [0, 1] denote 
the distribution of the state at time t given that a was the initial state. The total variation distance at 
time t with respect to the initial state a is defined as 

A.(t) = max|P*(a,5) - n{S)\ = i J] |P*(a,r) - 7r(r)|. 
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Starting from a, the rate of convergence to stationarity may then be measured by the function 

Ta = min{A^(i') < e~^for all t' > t}. 

We define the mixing time of the Metropolis process as 

T = max To-. 

Before showing Corollary [J we provide some auxiliary results. In the following proposition we show 
that for a given parameter A the stationary distribution of the metropolis process concentrates on a small 
range of sizes of independent sets. 

Proposition 4 Let I be an independent set chosen from the stationary distribution of the Metropolis 
process on the independent sets ofG{n, m) with parameter A as in (|21). Letting 

K = {k:\ii{Gn,mA)-k\<'in/d}. (17) 

we have that Pr[\I\ ^ K] < exp ( -^^n 



d2 

The proof of Proposition |4] appears in Section 19^1] 

Lemma 15 G(n, m) has the following property w.h.p. Let K be as in di7D . the set Ufce-ftr ^k{G) admits 
a partition into classes Ci , . . . , Cn such that 

L Pr[I € Ci] < exp{—n\nd/d) for each i < i < N. 

2. We have that Pr[I € Ui<i<ArCj] > 1 — exp {—nlnd/d). 

3. The distance between any two independent sets in different classes is at least 2. 

The proof of Lemma [T5] appears in Section |9^ Corollary [T]follows from the following lemma. 

Lemma 16 Consider the Metropolis process on the independent sets ofG{n, m) with parameter A as 
in ([2]). W.h.p. the mixing time of the process is greater than e"'''^ . 

Proof: Let K be as in dTTl ) and assume that Gn,m is such that \^k£K'^k{G) has a partition Ci, . . . ,Cn 
satisfying 1, 2 and 3 of Lemma [T5l 

The proof is going to be made by contradiction. Assume that our Metropolis process has mixing 
time T < e"/'^ . Let Xj be the state of the Metropolis process at time t > 0. 

Consider, now, the time interval [^1,^2] such that ti = n^T and ^2 = In^T. It is direct to show 
that the distribution of Xj, for t G [^1,^2] is extremely close to the stationary distribution, i.e. for 

2 

any t G [ti,t2] the total variation distance between the distribution of Xt and vr is at most e~" . By 
Proposition m Lemma [T5] and standard arguments about mixing time, it follows that for any t G [^1,^2] 



Pr 



Xt^ U ^' 



1<«<7V 

Applying the union bound we get that 



<2exp|-'""''";-°">' 



Pr 



3h<t<t2:lt(^ U ^* 



l<i<N 



, (l-o(l))ln'^d ,A ( In^d 

<2exp( -^ ^-^ n^nld^\ <expf-^^n 
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(18) 



The above relation implies that so as the Metropolis to get from X^^ to Xt^ w.h.p. it uses only independent 
sets which are in Ui<i<Af ^*- 

Let us exam now the states Xt^ to Xt^. We should observe that since the interval [^1,^2] is so long, i.e. 
n^T, the two states are (almost) independent of each other. Using Proposition |4] and (1) from Lemma 
[15] we get the that 



Pr^i.i E [iV],i / i : Xt^ G Ci KXt^ G C^] > 1 - 4exp(-nlnd/d). 



(19) 



Now (3) of Lemma [T5] causes the problem. For assume that there are two distinct i, j G \N\ such that 
Xi^ G Ci and Xt^ G Cj. Let t > ti be the first time that Xi ^ Cj. Then by definition we should have that 
dist(Xt,Xt^i) < 1. This implies thatXj ^ [Ji^^Ci because otherwise there would be two independent 
sets in different classes at distance one. Thus we conclude that 



Pr[3i,j G [iV], i / i : Xt^ G d AXt, G Cj] < Pr 
contradicting (fTSl ) and ( |T9l ). The lemma follows. 



3ti<t<t2:Xt^ U d 



l<i<N 







9. 1 Proof of Proposition |4] 

For the Metropolis process on the independent sets of Gn,m with parameter A, we let 

R{k,\) = \Sk{Gn,m)\X^. 

In essence, R{k, A) is the total weight carried by the independent sets of size k in Gn,m- For c G [0, l],we 
denote with Ac the value of the parameter of the Metropolis process that maximizes E[R{k,X)] for 
A: = (1 + c)^n. First, we show the following lemma. 

Lemma 17 For c G [0, 1] and A; = (1 + c)^n, it holds that 



Pr 



R{k,\c) < exp -UnJln^d/d^ E[R{k,\c)] 



< exp (— nln d/d 



Proof: Note that R{k, Ac) = {Xc)''X{k), where X{k) is the number of independent sets of size k in 
Gn,m- It is easy to see that using Proposition [T] we can get a lower bound on R{k, Ac), since we only 
need a lower bound on X{k). The lemma follows. 

Proposition |4] follows as a corollary from the following lemma. 
Lemma 18 For some constant c G [0, 1], let and k = {1 + c)-^n and 



Ro{\c) = exp -UnJln^ d/d^ E[R{k, X. 



It holds that 



Pr 



k':\k-k'\>^ 



R{k', Ac) > exp i-n/d) Ro{X) 



< exp (— 3n/d) 
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Proof: For convenience we write k, the size of the independent sets, in the fom|j 

2n 
k-e = — (lnd-lnln(i+l-ln2-e). 
a 

where e can vary so as A;„e < a{Gn,m)- Assume that for eo it holds that k-^g = (1 + c)^n. For 
convenience we define the following functions: 

• y(e) = (Ind-lnlnd+l -ln2-e) 

. /i(e) = |y(e)(lnA, + e-ln(y(e)/lnd)). 

Using Lemnia[3]and Lemma|2l we get that for any e (not necessarily eo) it holds that 

1 . „_ In^d 



h{e)<-lnE[R{k.„X,)]<h{e) + 3 ,„ 
n a^ 

First we need to compute a lower bound of In Ac with respect to eo- It is easy to get that 
Letting e' be such that /i'(e') = we get that 



It is direct ot check that e' is the maximum of the function h. For e = e' + q where \q\ < (In d)^' ^ we 
get that /i(e) < /i(e') - ^q"^. 

We let /(e) = ^ ln^[i?(A;_„ A^)]. Since /(e) < /i(e) + 2^ itis easy to see that |eo -e'| < d^^/s. 
Otherwise, i.e. if |eo — e'| > d"^/^ then we would get that /(eo) < h{e'), which is a contradiction. All 
the above imply that 

log Ac > 2/(eo) + 1 - eo + In T^") - 5d-^/'. (20) 

It holds that 



d \ \ \lnd J J \ \ Ind J J d 

Setting t = e — eo, then for |i| < 5 we get that 



d \ \lnd J \ Ind J d 

Using y(e) = y{eo)-t, we get ln(y(e)/ In d) = ln(y (eo)/ In d) -t/y (eo) -t7( V (eo)) -0(tV2/'(eo))- 
Then we get that 



/{f)-/(«o) < I 



<(_,„,,+ „.„) + 1 _.„+,„ I f^) - o«v.=(.o))) -*i^-^)- ^^ 



Using dinil and the fact that 1/2 < f^ < 1 we get 



logd 

/(e)-/(6o)<^[5/lnd-i2] 



^The reader should note that this parametrization of the size of independent sets is different than the one in Definition|4] 
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For sufficiently large (i and e such that I e—eo I = 1 wegetthat£'[i?(A:_e, Ac)] < i?[i2(A;_eo, Ac)] exp (— 1.9?i/d). 
By the Markov inequality we get that 



R{k^eAc) > exp -15nJln^d/d^ E[R{k_^^,Xc)] 



Pr [R{k-„ Ac) > exp {-n/d) Ro{Xc)] < Pr 

< Pr [Rik^e, Ac) > exp {3n/{2d)) E[R{k^„ Ac)]] 

< exp(-3n/(2(i)). 

It is easy to see that for |t| > 1 the above probability is even smaller. The lemma follows. 

9.2 Proof of Lemma [m 

LemmafTSlfoUows from the following lemma. 

Lemma 19 Let {G, a) G A/c(n, m) be distributed as in Uk{n, m), for k £ K, where K and fJ,{G, A) 
is as in ([17\) and (|2]), respectively. The set UfeG-ft' '^kiG) admits a partition into classes Ci, . . . , Cn such 
that 

1. Pr[a € Ci\ < exp{—nln d/ d) for any i G [N] 

2. Pr[a ^ Ui6[JV] ^i\ ^ exp{-nlnd/d) 

3. The distance between two independent sets in different classes is at least 2. 

To see that Lemma \T5\ follows from Lemma [I9j first note that for the pair (G, a) distributed as in 
Uk{n, m) and any A C 2^"!, it holds that 

Pr[a G A] < maxPr[X G A\\I\ = k] 

for any k. Given Lemma [T9] we have the following: The statement 1 in Lemma [T5] follows by noting 
Pr\I G Ci] < maxfcgii' Pr[X G Ci\\I\ = k] and using the statement 1 in Lemma [T9l Similarly, the 
statement 2 of LemmafTSlfollows by noting that Pr\I ^ UiefAf] ^«] — ^^^fcei^" Pf^^ ^ UiefAri ^i\\^\ — 
k] and using statement 2 in Lemma [19] The statement 3 in Lemma [15] and statement 3 from Lemma [19] 
ai^e identical. 

So as to show Lemma [19] it is sufficient to prove the following lemma and invoke the transfer 
theorem. 

Lemma 20 Let (G, a) be distributed as in Vk{n, m), with k £ K and K and n{G, A) is as in ([77]) and 
([2]), correspondingly. Let Sa{x,t) C St{G) contain the independent sets having xk common vertices 
with a, where x G [0, 1] and integer t > 0. 

L There are constants < yi < y2 < 1 such that for any yi < x < 1/2 it holds that 

Pr [Ue/r 5^(^,i) = 0] > 1 -exp(-nlnd/d). 



2. Pr 



Y.t£K I Ux>j/2 'S'(T(2;,i)l > exp(8nlnd/(i) < exp(-5nln(i/(i). 



It is easy to see that given the statement 1 of Lemma [20] the transfer theorem implies the statements 2 
and 3 of Lemma [19] The statement 2 of Lemma [20] combined with the transfer theorem and the concen- 
tration result of the number of independent sets of size k in Gn,m we get the statement 1 of Lemma [19] 
It remains to prove Lemma [20] 
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Proof of Lemma |20l As far as the first statemtent of the lemma is concerned, consider a pair (G, a) S 
Ak{n,m) distributed asin'Pfc(n,?7i), with k ^ K. We are going to show that there is a range [yi, 7/2] such 
that for any x G [yi , 7/2] and any t G J^ it holds that E[S{t, x)] < exp(— 2n In d/d). Then statement 1 of 
the lemma follows by noting that X^^g^ E[Sc^{x)] < exp(— 3/2nlii(i/d) and by invoking the Maitov 
inequality. 

For convenience we set s = (k/n) and we write t in the form t = ak for appropriate a. So as to 
have t £ K there exists a constant q, such that a G (1 — q/ln d,l + q/ln d). Clearly, it holds that 

r.r./ M {k\{n-k\(l-{k/nf-{t/nf + {sxf\'^, 
Substituting t = akin the above inequality we get 

Thus we get 

llnE[A{x,t)] < x[l-lnx] + (a-x)[l-lns-ln(a-x)] -(l + e)(a-2;)ln(i^ 
< 3 + (a-x)lnd[l-(l + e)^] 

where in the second derivation we used the fact that — In s < In d, —qlnq < e~^ for any real q > and 
for sufficiently large dO < a < 1.2. 

It is clear that for given e and a such that t G K in the r.h.s. of the last inequality, above, we can find 
an appropriate range for x such that ^ In E[A{x, t)] < —2. The statement 1 of the lemma follows. 

We now give the proof of the statement 2 of the lemma. Using the above inequalities, it is easy to 
see that we can take sufficiently large x so as to have ^ ln£'[^(x, t)] > 0. In particular, it should hold 
that (a — x) lnd(l — (1 — e)(a + x)/2) < 3. In this case it is easy to show that ^ ln£'[^(x, t)] < 3. 
Using the Markov inequality we get that 



Pr 



y^ A{x,k) > exp(8A;) 

x>y2 



< exp{—5k). 
The statement 2 of the lemma follows. 



10 Large deviations results 

Corollary 4 There is a constant do > such that for all d > do there is a number ^ = ^(d) > such 
that for any In d/d < q < 2 In d/d the following is true. Let I be the number of independent sets of size 
\an\ in G{n, m). Then X < exp{—S,n) ■ EX w.h.p. 

Proof: Let G be a graph chosen from the following 'planted' distribution: any edge that does not 
connect two vertices from 5 = {1, . . . , \a7i\ } is present in G with probability q independently, where q 
is chosen so that the expected number of edges equals m. That is, 

m m d 



(2)-(T) (2)(1-"^) n{l-a^y 

Let X be the number of vertices in v £ S that are isolated in G. The expected degree of each v £ S 

equals 

, l-a d 

7 = q[l — a)n = d 



1-q2 1 + a' 
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Moreover, the degree of each v ^ S is binomially distributed. Therefore, each v ^ S is isolated with 
probability (1 — 6d) exp(— d/(l + a)), where (5^ — ;■ for large d. As the degrees of the vertices v ^ S 
are mutually independent random variables, we thus see that X is binomially distributed as well and 

EX > an ■ (1 — Sd) exp{—d/{l + a)) > anex.p{—d{l — a + a )) > 100nexp(— d), 

provided that d > do is sufficiently large. Furthermore, as X is binomially distributed, Chemoff bounds 
yield a number ^ = ^{d) > such that 

P[X < 2nexp(-d)] < exp(-3^n). (21) 

On the other hand, let Y be the total number of isolated vertices in G{n,m). Since the degree of 
each vertex in G{n,m) is binomially distributed with mean d, we have EY < nexp(— d). Further- 
more, a standard second moment computation shows that Var(y) = 0{n). Therefore, by Chebyshev's 
inequality Y < 1.9nexp(— d) w.h.p. 

Thus, let £ C Afc(n,77i) be the set of all pairs {G,a) such that G has at least 100nexp(— d) 
isolted vertices. We have shown that P-p [£]>! — exp(— 3^n), while Fy [£] = o(l). However, if 
P [X > exp(— ^n)EX] > e for some fixed e > 0, then by the same argument as in Theorem |5] we 
would obtain P-p [-'£] > (1 — o(l))eexp(— ^n), which is a contradiction. Hence, we have shown that 
P [X > exp(-^n)EX] = o(l). 
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